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Noam Berger^’^ • Marek Biskup^ 

Quenched invariance principle for simple 
random walk on percolation clusters 


Abstract. We consider the simple random walk on the (unique) infinite cluster of super¬ 
critical bond percolation in Z'* with d > 2. We prove that, for almost every percolation 
configuration, the path distribution of the walk converges weakly to that of non-degenerate, 
isotropic Brownian motion. Our analysis is based on the consideration of a harmonic de¬ 
formation of the infinite cluster on which the random walk becomes a square-integrable 
martingale. The size of the deformation, expressed by the so called corrector, is estimated 
by means of ergodicity arguments. 


1. Introduction 

1.1. Motivation and model 

Consider supercritical bond-percolation on d > 2, and the simple random walk 
on the (unique) infinite cluster. In [38] Sidoravicius and Sznitman asked the fol¬ 
lowing question: Is it true that for a.e. configuration in which the origin belongs to 
the infinite cluster, the random walk started at the origin exits the infinite symmet¬ 
ric slab {{xi ,..., Xd)'. \xd\ < N} through the “top” side with probability tending 
to V 2 as W —> 00 ? Sidoravicius and Sznitman managed to answer their question 
affirmatively in dimensions d > 4 but dimensions d = 2,3 remained open. In this 
paper we extend the desired conclusion to all d > 2. As in [38], we will do so by 
proving a quenched invariance principle for the paths of the walk. 

Random walk on percolation clusters is only one of many instances of “sta¬ 
tistical mechanics in random media” that have been recently considered by physi¬ 
cists and mathematicians. Other pertinent examples include, e.g., various diluted 
spin systems, random copolymers [40], spin glasses [10,41], random-graph mod¬ 
els [9], etc. From this general perspective, the present problem is interesting for at 
least two reasons: First, a good handle on simple random walk on a given graph 
is often a prerequisite for the understanding of more complicated processes, e.g., 
self-avoiding walk or loop-erased random walk. Second, information about the 
scaling properties of simple random walk on percolation cluster can, in principle, 
reveal some new important facts about the structure of the infinite cluster and/or 
its harmonic properties. 
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Let us begin developing the mathematical layout of the problem. Let be 
the c?-dimensional hypercubic lattice and let be the set of nearest neighbor 
edges. We will use b to denote a generic edge, (x, y) to denote the edge between x 
and y, and e to denote the edges from the origin to its nearest neighbors. Let Q = 
{0,1}® be the space of all percolation configurations u = (u;{,)hgBd. Here Wf, = 1 
indicates that the edge b is occupied and Wf, = 0 implies that it is vacant. Let ^ 
be the Borel cr-algebra on 17—defined using the product topology—and let P be 
an i.i.d. measure such that P(wb = 1) = p for all b G B^. If x oo denotes the 
event that the site x belongs to an infinite self-avoiding path using only occupied 
bonds in co, we write ‘rfoo = ‘^oo (w) for the set 

‘^oo(w) = {a; G X oo}. (1.1) 

By Burton-Keane’s uniqueness theorem [12], the infinite cluster is unique and so 
is connected with P-probability one. 

For each x G Z"^, let Ta,: 17 ^ 17 be the “shift by x” defined by (TxLo)b = 
ujx+b- Note that P is r^-invariant for all x G Z'^. Let pc = pdd) denote the perco¬ 
lation threshold on Z'^ defined as the infimum of all p’s for which P(0 G “^oo) > 0. 
Let l7o = (0 G '^ooj and, forp > pc, define the measure Pq by 

Po(A) =P(A|f7o), Ae^. (1.2) 


We will use Eq to denote expectation with respect to Pq. 

For each configuration lo G 17o, let (X„)„>o be the simple random walk on 
'^oo(w) started at the origin. Explicitly, (7f„)„>o is a Markov chain with state 
space Z'^, whose distribution Pq.w is defined by the transition probabilities 


-fo.oj (7fn-t-l — X -f — x) — Ijoje —1} 


|e| = 1, 


(1.3) 


and 


f}l,a;(7fn+l — — x) — 


2(i 

e: \e\=l 


with the initial condition 

= 0 ) = 1 . 


(1.4) 


(1.5) 


Thus, at each unit of time, the walk picks a neighbor at random and if the corre¬ 
sponding edge is occupied, the walk moves to this neighbor. If the edge is vacant, 
the move is suppressed. 


1.2. Main results 

Our main result is that for Pp-almost every w G l7o, the linear interpolation of 
(X„), properly scaled, converges weakly to Brownian motion. For every T > 0, 
let {C[Q,T],Wt) be the space of continuous functions /: [0,T] —> R equipped 
with the cr-algebra Wt of Borel sets relative to the supremum topology. The precise 
statement is now as follows: 
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Theorem 1.1. Let d > 2, p > Pc{d) and let lo G l7o- (^n)n>o be the random 
walk with law Pq^uj ond let 

+ (in- t>0. (1.6) 

wn 


Then for all T > 0 and for Wg-almost every uj, the law of {Bn{t) '■ 0 < t < T) 
on (Cp, T], converges weakly to the law of an isotropic Brownian motion 
(Bt ■ 0 < t < T) whose diffusion constant, D = > 0, depends only on 

the percolation parameter p and the dimension d. 

The Markov chain (X„)„>o represents only one of two natural ways to define 
a simple random walk on the supercritical percolation cluster. Another possibility 
is that, at each unit of time, the walk moves to a site chosen uniformly at random 
from the accessible neighbors, i.e., the walk takes no pauses. In order to define 
this process, let {Tk)k>o be the sequence of stopping times that mark the moments 
when the walk {Xn)n>o made a move. Explicitly, Tq = 0 and 

Tn+i = inf{k > Tn'. Xk f Xk-i}, n > 0. (1.7) 


Using these stopping times—which are Po.w-almost surely finite for all to G l7o— 
we define a new Markov chain {X^)n>o by 

X; = Xt„, n>0. (1.8) 


It is easy to see that (Ar^)„>o has the desired distribution. Indeed, the walk starts 
at the origin and its transition probabilities are given by 


Po,ujiX'„ = x + e\X'„ = x) 


_ ^{uJe = l} _ 

: |e'| = l 


|e| = l. (1.9) 


A simple modification of the arguments leading to Theorem 1.1 allows us to es¬ 
tablish a functional central limit theorem for this random walk as well; 


Theorem 1.2. Let d > 2, p > Pc{d) and let lo G f2o. Let {X'^)„>q be the ran¬ 
dom walk defined from {Xn)n>o os described in (1.8) and let B!^{t) be the linear 
interpolation of {X'f,)o<k<n defined by (1.6) with (Xk) replaced by (X'f.). Then 
for all T > 0 and for Fo-almost every lo, the law of (B'fft): 0 < t < T) 
on (C7[0,T],#t) converges weakly to the law of an isotropic Brownian motion 
{Bt ■ 0 < t < T) whose diffusion constant, D' = E{\Bif) > 0, depends only on 
the percolation parameter p and the dimension d. 

De Gennes [17], who introduced the problem of random walk on percolation 
cluster to the physics community, thinks of the walk as the motion of “an ant in a 
labyrinth.” From this perspective, the “lazy” walk (Xn) corresponds to a “blind” 
ant, while the “agile” walk {X{f) represents a “myopic” ant. While the character of 
the scaling limit of the two “ants” is the same, there seems to be some distinction 
in the rate the scaling limit is approached, cf [22] and references therein. As we 
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will see in the proof, the diffusion constants D and D' are related via D' = D0^, 
where 0“^ is the expected degree of the origin normalized by 2d, cf (6.23). 

There is actually yet another way how to “put” simple random walk on 
and that is to use continuous time. Here the corresponding result follows by com¬ 
bining the CLT for the “lazy” walk with an appropriate Renewal Theorem for 
exponential waiting times. 

1.3. Discussion and related work 

The subject of random walk in random environment has a long history; we refer 
to, e.g., [10,42] for recent overviews of (certain parts of) this field. On general 
grounds, each random-media problem comes in two distinct flavors: quenched, 
corresponding to the situations discussed above where the walk is distributed ac¬ 
cording to an w-dependent measure Po,ai, and annealed, in which the path distri¬ 
bution of the walk is taken from the averaged measure A Eo(Po,u;(^))- Under 
suitable ergodicity assumptions, the annealed problem typically corresponds to the 
quenched problem averaged over the starting point. Yet the distinction is clear: In 
the annealed setting the slab-exit problem from Sect. 1.1 is trivial by the sym¬ 
metries of the averaged measure, while its answer is a priori very environment- 
sensitive in the quenched measure. 

An annealed version of our theorems was proved in the 1980s by De Masi et 
al [13,14], based on earlier results of Kozlov [28], Kipnis and Varadhan [27] and 
others in the context of random walk in a field of random conductances. (The re¬ 
sults of [13, 14] were primarily two-dimensional but, with the help of [3], they 
apply to all d > 2; cf [38].) A number of proofs of quenched invariance princi¬ 
ples have appeared in recent years for the cases where an annealed principle was 
already known. The most relevant paper is that of Sidoravicius and Sznitman [38] 
which established Theorem 1.2 for random walk among random conductances in 
alld > 1 and, using a very different method, also for random walk on percolation 
in d > 4. (Thus our main theorem is new only in d = 2,3.) The d > 4 proof 
is based on the fact that two independent random walk paths will intersect only 
very little—something hard to generalize to d = 2,3. As this paper shows, the 
argument for random conductances is somewhat more flexible. 

Another paper of relevance is that of Rassoul-Agha and Seppalainen [37] where 
a quenched invariance principle was established for directed random walks in 
(space-time) random environments. The directed setting offers the possibility to 
use independence more efficiently—every time step the walk enters a new environ¬ 
ment—but the price to pay for this is the lack of reversibility. The directed nature of 
the environment also permits consideration of distributions with a drift for which 
a CLT is not even expected to generally hold in the undirected setting; see [6,39] 
for an example of “pathologies” that may arise. 

Finally, there have been been a number of results dealing with harmonic prop¬ 
erties of the simple random walk on percolation clusters. Grimmett, Kesten and 
Zhang [20] proved via “electrostatic techniques” that this random walk is transient 
in d > 3; extensions concerning the existence of various “energy flows” appeared 
in [1,24, 26, 29, 33]. A great amount of effort has been spent on deriving esti¬ 
mates on the heat-kernel—i.e., the probability that the walk is at a particular site 
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Fig. 1. A portion of the infinite cluster ^txj = '^oo(oj) before (left) and after (right) the 
harmonic deformation x x + Here p = 0.75 is already so large that all but 

a few sites in the entire block belong to Upon the deformation, all “holes” (i.e., dual 
connected components) get considerably stretched and rounded while the “dangling ends” 
collapse onto the rest of the structure. 


after n steps. The first such bounds were obtained by Heicklen and Hoffman [23]. 
Later Mathieu and Remy [31] realized that the right way to approach heat-kernel 
estimates was through harmonic function theory of the infinite cluster and thus sig¬ 
nificantly improved the results of [23]. Finally, Barlow [3] obtained, using again 
harmonic function theory, Gaussian upper and lower bounds for the heat kernel. 
We refer to [3] for further references concerning this area of research. 

Note: At the time a preprint version of this paper was first circulated, we learned 
that Mathieu and Piatnitski had announced a proof of the same result (albeit in 
continuous-time setting). Their proof, which has in the meantime been posted [30], 
is close in spirit to that of Theorem 1.1 of [38]; the main tools are Poincare inequal¬ 
ities, heat-kernel estimates and homogenization theory. 

1.4. Outline 

Let us outline the main steps of our proof of Theorems 1.1 and 1.2. The princi¬ 
pal idea—which permeates in various disguises throughout the work of Papan- 
icolau and Varadhan [35], Kozlov [28], Kipnis and Varadhan [27], De Masi et 
al [13,14], Sidoravicius and Sznitman [38] and others—is to consider an embed¬ 
ding of (w) into the Euclidean space that makes the corresponding simple ran¬ 
dom walk a martingale. Formally, this is achieved by finding an R'^-valued discrete 
harmonic function on '^ao with a linear growth at infinity. The distance between 
the natural position of a site x € laoo and its counterpart in this harmonic embed¬ 
ding is expressed in terms of the so-called corrector x{x, oj) which is a principal 
object of study in this paper. See Fig. 1 for an illustration. 

It is clear that the corrector can be defined in any finite volume by solving an 
appropriate discrete Dirichlet problem (this is how Fig. 1 was drawn); the diffi- 
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cult part is to define the corrector in infinite volume while maintaining the natural 
(distributional) invariance with respect to shifts of the underlying lattice. Actually, 
there is an alternative, probabilistic definition of the corrector, 

x(x,uj)= liui {E^ ^{Xn) - Eo,tu{Xn))- (1-10) 

n—*oo 

However, the only proof we presently have for the existence of such a limit is by 
following, rather closely, the constructions from Sect. 2.3. 

Once we have the corrector under control, the proof splits into two parts: 
(1) proving that the martingale—i.e., the walk on the deformed graph—converges 
to Brownian motion and (2) proving that the deformation of the path caused by 
the change of embedding is negligible. The latter part (which is the principal con¬ 
tribution of this work) amounts to a sublinear bound on the corrector xix, uj) as a 
function of x. Here, somewhat unexpectedly, our level of control is considerably 
better in d = 2 than in d > 3. In particular, our proof in d = 2 avoids using 
any of the recent sophisticated discrete-harmonic analyses but, to handle all d > 2 
uniformly, we need to invoke the main result of Barlow [3]. The proof is actually 
carried out along these lines only for the setting in Theorem 1.1; Theorem 1.2 
follows by noting that the time scales of both walks are comparable. 

Here is a summary of the rest of this paper: In Sect. 2 we introduce the afore¬ 
mentioned corrector and prove some of its basic properties. Sect. 3 collects the 
needed facts about ergodic properties of the Markov chain “on environments.” 
Both sections are based on previously known material; proofs have been included 
to make the paper self-contained. The novel parts of the proof—sublinear bounds 
on the corrector—appear in Sects. 4-5. The actual proofs of our main theorems 
are carried out in Sect. 6. The Appendix (Sects. A and B) contains the proof of an 
upper bound for the transition probabilities of our random walk, further discussion 
and some conjectures. 

2. Corrector—construction and harmonicity 

In this section we will define and study the aforementioned corrector which is the 
basic instrument of our proofs. The main idea is to consider the Markov chain 
“on environments” (Sect. 2.1). The relevant properties of the corrector are listed in 
Theorem 2.2 (Sect. 2.2); the proofs are based on spectral calculus (see Sect. 2.3). 


2.7. Markov chain “on environments” 


As is well known, cf Kipnis and Varadhan [27], the Markov chain {Xn)n>o in 
(1.3-1.5) induces a Markov chain on Hq, which can be interpreted as the trajec¬ 
tory of “environments viewed from the perspective of the walk.” The transition 
probabilities of this chain are given by the kernel Q: Hq x ^ [0, 1], 


Q{u},A) 


1 


e: |e| = l 


l{TeWeA} T l{(Je=0} 




( 2 . 1 ) 


Our basic observations about the induced Markov chain are as follows: 
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Fig. 2. The harmonic deformation of a percolation configuration in the symmetric slab 
{(* 1 , 3 : 2 ) € : |a; 2 | < A^}. The star denotes the new position of the origin which in the 

undeformed configuration was right in the center. The relative vertical shift of the origin cor¬ 
responds to the deviation of Po,i.;(top hit before bottom) from one half. The figure also has 
an interesting electrostatic interpretation: If the bottom and top bars are set to potentials —1 
and -1-1, respectively, then the site with deformed coordinates (xi, * 2 ) has potential X2/N. 


Lemma 2.1. For every bounded measurable /: 17 —> K. and every e with |e| = 1, 

Eo(/orel{^,^i}) =Eo(/l{„_,=i}), (2.2) 

where —e is the bond that is opposite to e. As a consequence, Pq is reversible and, 
in particular, stationary for Markov kernel Q. 


Proof. First we will prove (2.2). Neglecting the normalization by P(0 G '^oo)^ we 
need that 

E(/ O Te l{0e<^oo} l{tae = l}) = E(/ l{0e<^oo} = (2-3) 

This will follow from = l{cu-e=i} °''’e '^he fact that, on {cce = 1} we 

have l{oe‘^oo} = Ifoe'^oo} Indeed, these observations imply 

/ o Te l{0e‘^oo} = = (/ IfOe'^oo} l{:.^-e = l}) o 'fe (2.4) 

and (2.3) then follows by the shift invariance of P. 

From (2.2) we deduce that for any bounded, measurable f,g: 17 ^ R, 

Eo(/(Q5)) =Eo(5(Q/)), (2.5) 


where Qf: 17 —> R. is the function 


(Q/)(w) 


1 


+ l{‘^e=0} /M) • 

e: |e| = l 


(2.6) 


Indeed, splitting the last sum into two terms, the second part reproduces exactly on 
both sides of (2.5). For the first part we apply (2.2) and note that averaging over e 
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allows us to neglect the negative sign in front of e on the right-hand side. But (2.5) 
is the definition of reversibility and, setting / = 1 and noting that Q1 = 1, we also 
get the stationarity of Pq. □ 

Lemma 2.1 underlines our main reason to work primarily with the “lazy” walk. 
For the “agile” walk, to get a stationary law on environments, one has to weigh Pq 
by the degree of the origin—a factor that would drag through the entire derivation. 


2.2. Kipnis-Varadhan construction 


Next we will adapt the construction of Kipnis and Varadhan [27] to the present 
situation. Let Po) be the space of all Borel-measurable, square 

integrable functions on 12. Abusing the notation slightly, we will use “L^” both 
for R-valued functions as well as K.'^-valued functions. We equip with the inner 
product (/, p) = Eo(/(/)—with “/p” interpreted as the dot product of / and g 
when these functions are vector-valued. Let Q be the operator defined by (2.6). 
Note that, when applied to a vector-valued function, Q acts like a scalar, i.e., inde¬ 
pendently on each component. 

From (2.5) we know 

if,Q9) = {Qf,g) (2.7) 

and so Q is symmetric. An explicit calculation gives us 


|(/,Q/)|<^ = f OTe)\ + {f,l{u,=0} f)] 


|e| = l 


- = /)| ^\{fA{ul-e = l} f)\ ^ + (/)l{we=0} 


|e| = l 


- ^ = l{‘^e=0} /) } = if, f) 

e: |e| = l 

and so HQIIls < 1. In particular, Q is self-adjoint and spec((5) C [—1,1]. 
Let V: 12 ^ be the local drift at the origin, i.e.. 


( 2 . 8 ) 


V(oj) 


1 


Y ^h^e = l}- 

e: \e\=l 


(2.9) 


(We will only be interested in V{u}) for lo G f2o, but that is of no consequence 
here.) Clearly, since V is bounded, we have V G L'^. For each e > 0, let 12 ^ 
be the solution of 

{l + e-Q)fi, = V. ( 2 . 10 ) 

Since 1 — Q is a non-negative operator, is well-defined and G for all 
e > 0. The following theorem is the core of the whole theory: 

Theorem 2.2. There is a function x: x 12o —> R'^ such that for every x G if, 

\\^l{^(,<^^}iip^oT^-ipf) =xix,-), inC^. (2.11) 


Moreover, the following properties hold: 
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(1) (Shift invariance) For Fo-almost every oj G 

x{x,uj) - x{y,^) = x{x - y,Ty{uj)) (2.12) 

holds for all x,y £ ^oo (w)- 

(2) (Harmonicity) For Fo-almost every to G Qq, the function 

X x(a^) + a; (2.13) 

is harmonic with respect to the transition probabilities (1.3-1.4). 

(3) (Square integrability) There exists a constant C < oo such that 

||[x(x + e,-) - x(a;,-)] l{a;e«’oo} ^ (2.14) 

is true for all x G and all e with \e\ = 1. 

The rest of this section is spent on proving Theorem 2.2. The proof is based 
on spectral calculus and it closely follows the corresponding arguments from [27]. 
Alternative constructions invoke projection arguments, cf [30,34]. 


2.3. Spectral calculations 

Let py denote the spectral measure of Q-. Lf ^ Lf associated with function V, 
i.e., for every bounded, continuous “F : [—1,1] ^ R, we have 

{V,F{Q)V) = J ^F{X)pv{dX). (2.15) 

(Since Q acts as a scalar, pv is the sum of the “usual” spectral measures for the 
Cartesian components of VI) In the integral we used that, since spec((5) G [—1,1], 
the measure pv is supported entirely in [—1,1]. The first observation, made al¬ 
ready by Kipnis and Varadhan, is stated as follows: 


Lemma 2.3. 



py(dA) < oo. 


(2.16) 


Proof. With some caution concerning the infinite cluster, the proof is a combi¬ 
nation of arguments right before Theorem 1.3 of [27] and those in the proof of 
Theorem 4.1 of [27]. Let / G be a bounded real-valued function and note that, 
by Lemma 2.1 and the symmetry of the sums, 

eEo(/l{^^=i}) = i Y eEo((/-/oTe) (2.17) 

e: |e|=l e: |e|=l 
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Hence, for every a e we get 


{f,a-V) = (e-a)Eo((/-/ore)l{^^=i}) 


2 2d 


e: |e| = l 


< 


1 / 1 


(We = 1 


(m E 

■e| = l 


(2.18) 


e: |e|=l 


The first term on the right-hand side equals a constant times |a|, while Lemma 2.1 
allows us to rewrite the second term into 


1 


e: |e| = l 



^ Eo(/(/-/ore)l{.,=i}) =2(/,(l-g)/). 

e: |e| = l 


(2.19) 


We thus get that there exists a constant Co < oo such that for all bounded / G 

|(/,a-V^)| <Co|a|(/,(l-Q)/)''^ (2.20) 

Applying (2.20) for / of the form f = a • ^{Q)V, summing a over coordinate 
vectors in K.'^ and invoking (2.15), we find that for every bounded continuous 
'P: [-1,1] ^ R and C = Co^/d, 


<f'(A)/ry(dA) 


<c(^y'(l-A)<f'(A)2py(dA) 


( 2 . 21 ) 


Substituting If'e(A) = (Ve) A for W and noting that (1 — A)'f'e(A) < 1, we get 


y’<f^,(A)py(dA) < C 


\ V 2 

<f"e(A)py(dA) j 


( 2 . 22 ) 


and so 

J ^,{X)^iv{dX)<C\ 

The Monotone Convergence Theorem now implies 


--r/rr/(dA) = sup 

J- — A e>0 


<f'e(A)py(dA) < C^ < 00 , 


(2.23) 


(2.24) 


proving the desired claim. □ 

Using spectral calculus we will now prove: 
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Lemma 2.4. Let t/’e be as defined in (2.10). Then 

lime||'i/'e||2 = 0. (2.25) 

eiO 

Moreover, for e with \e\ = 1 let ci"'’ = l{oe'^oo} l{we=i}(V’e °Te- tpe)- Then for 
all X & if and all e with \e\ = 1, 

lim oTx- o tJL = 0 (2.26) 

ei,e2i0 


Proof. The main ideas are again taken more or less directly from the proof of 
Theorem 1.3 in [27]; some caution is necessary regarding the containment in the 
infinite cluster in the proof of (2.26). By the definition of t/>e, 

^11^^112 = + (2.27) 


The integrand is dominated by and tends to zero as e J, 0 for every A in the 
support of fiy. Then (2.25) follows by the Dominated Convergence Theorem. 

The second part of the claim is proved similarly: First we get rid of the x- 
dependence by noting that, due to the fact that o Tx f 0 enforces x € ^oo, 
the translation invariance of P implies 

\\Gi^^^ oTx-G<f^KTx\\^< (2.28) 

Next we square the right-hand side and average over all e. Using that Ge 0 also 
enforces oje = 1 and applying (2.17), we thus get 


1 




2('0ei - ■062) (1 - Q){fei - fef)- 


(2.29) 


Now we calculate 




fe2, (1 - Q){lpei - Ipe^)) 

^ (ei-e2)^(l-A) 

(1 + ei - A)2(l -I- £2 - A)2 


py(dA). 


(2.30) 


The integrand is again bounded by for all ei,e 2 > 0, and it tends to zero 
as ei, £2 i 0. The claim follows by the Dominated Convergence Theorem. □ 

Now we are ready to prove Theorem 2.2: 

Proof of Theorem 2.2. Let G^S' o Tx be as in Lemma 2.4. Using (2.26) we know 
that G'f^ o Tx converges in as £ J, 0. We denote Gx,x+e = limejo G^f^ o Tx. 
Since G^t' o Tj, is a gradient field on '^oo, we have Gx,x+e{‘-f) + Gx+e,xipf) = 0 
and, more generally, X]fc=o — 0 whenever (xq, . •., x„) is a closed loop 

on “^oo- Thus, we may define 


n—1 

X(x,w) = '^Gxk,xk+A^), 


(2.31) 
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where {xq,xi, ..., a;„) is a nearest-neighbor path on connecting xq = 0 

to Xn = X. By the above “loop” conditions, the definition is independent of this 
path for almost every co G {x G ‘rfoo}- The shift invariance (2.12) now follows 
from this definition and Gx,x+e = Go,e o Xx. 

In light of shift invariance, to prove the harmonicity of a; x -f xixjCo) it 
suffices to show that, almost surely. 


1 


[x(0,-)-X(e,-)] 

e: |e| = l 


= V. 


Since x(e, •) — x(0, •) = Go,e, the left hand side is the e J, 0 limit of 


1 


e: \e\ = l 


OTe] = = (1 - Q)i’e- 


(2.32) 


(2.33) 


The definition of tells us that (1 — Q)'ipe = —e^/’e + V. From here we get (2.32) 
by recalling that tends to zero in L^. 

To prove the square integrability in part (3) we note that, by the construction 
of the corrector. 


[x{x + e, •) - x{x, •)] l{a;e«’cx,} l-{a;e = l} °Xx = Gx,x+e- (2.34) 

But Gx,x+e is the L^-limit of L^-functions o Tx whose L^-norm is bounded 
by that of Gi*^\ Hence (2.14) follows with G = maxg; |e|=i IjGo.elU- □ 


3. Ergodic-theory input 

Here we will establish some basic claims whose common feature is the use of 
ergodic theory. Modulo some care for the containment in the infinite cluster, all of 
these results are quite standard and their proofs (cf Sect. 3.2) may be skipped on 
a first reading. Readers interested only in the principal conclusions of this section 
should focus their attention on Theorems 3.1 and 3.2. 

3.1. Statements 

Our first result concerns the convergence of ergodic averages for the Markov chain 
on environments. The claim that will suffice for our later needs is as follows: 

Theorem 3.1. Let / G Po). Then for Fo-almost all u G fl, 

^ n—1 

lim - ./ o TXk (w) = Eo(/), Po,i...-almost surely. (3.1) 

n—*oo fl • ^ 

Similarly, iff: 17 x 17 —> M. is measurable with¥,o{Eo^^\f(uj,TXiix>)\) < oo, then 

^ n—1 

lim f{Tx^uj,Tx^+,u}) =Eo{Eo^^{f{uj,Tx,uj))) (3.2) 

n—^oo fl ' 

forF^-almost all uj and Po^uj-oltnost all trajectories of {Xk)k>o. 
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The next principal result of this section will be the ergodicity of the “induced 
shift” on l7o- To define this concept, let e be a vector with |e| = 1 and, for every 
oj G flo, let 

n{uj) = min{fc > 0: fee G ‘^oo(w)}- (3.3) 

By Birkhoff’s Ergodic Theorem we know that {k > 0: ke G '^ao} has posi¬ 
tive density in N and so n(w) < oo almost surely. Therefore we can define the 
map (Te : f?o ^ by 

(Je(u;) = r„(^)e W. (3.4) 

We call (Je the induced shift. Then we claim: 

Theorem 3.2. For every e with \e\ = 1, the induced shift ae- ^ is Po" 
preserving and ergodic with respect to Pq. 

Both theorems will follow once we establish of ergodicity of the Markov chain 
on environments (see Proposition 3.5). For finite-state (irreducible) Markov chains 
the proof of ergodicity is a standard textbook material (cf [36, page 51]), but our 
state space is somewhat large and so alternative arguments are necessary. Since 
we could not find appropriate versions of all needed claims in the literature, we 
include complete proofs. 

3.2. Proofs 

We begin by Theorem 3.2 which will follow from a more general statement. 
Lemma 3.3, below. Let {X, 3f, p) be a probability space, and let T: A” ^ X 
be invertible, measure preserving and ergodic with respect to p. Let A G be of 
positive measure, and define n: A—!-NU{oo}by 

n{x) = minjfc > 0: T^{x) G A}. (3.5) 

The Poincare Recurrence Theorem (cf [36, Sect. 2.3]) tells us that n(x) < oo 
almost surely. Therefore we can define, up to a set of measure zero, the map 
S: Ahy 

S'(x) x G A. (3.6) 

Then we have: 

Lemma 3.3. S is measure preserving and ergodic with respect to p{-\A). It is also 
almost surely invertible with respect to the same measure. 

Proof. (1) S is measure preserving: For j > 1, let Aj = {x G A-. n{x) = j}. 
Then the Aj’s are disjoint and p{A \ ljj>i ^j) = 0- First we show that 

ifj ^ n = 0. (3.7) 

To do this, we use the fact that T is invertible. Indeed, if a; G S{Ai) n >S'(gIj) 
for 1 < i < j, then x = T'^{y) = T^{z) for some y,z G A with n{y) = i 
and n{z) = j. But the fact that T is invertible implies that y = Tl~'^{z), which 
means n{z) < j — i < j, a. contradiction. To see that S is measure preserving, we 
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note that the restriction of S to Aj is T^, which is measure preserving. Hence, S 
is measure preserving on Aj and, by (3.7), on the disjoint union ljj>i well. 

(2) S is almost surely invertible: S'“^({a;})n {S' is well defined} is a one-point 
set by the fact that T is itself invertible. 

(3) S is ergodic: Let B G ^ he such that B Q A and 0 < IJ.{B) < fi{A). 
Assume that B is S-invariant. Then S”(a;) ^ A \ B for all x G i? and all n > 1. 
This means that for every x G B and every fc > 1 such that T^{x) G A, we have 
T^{x) ^ A \ i3. If follows that C = ljfc>i T^{B) is (almost-surely) T-invariant 
and /i(C') G (0,1), a contradiction with the ergodicity of T. □ 

Proof of Theorem 3.2. We know that the shift Te is invertible, measure preserving 
and ergodic with respect to P. By Lemma 3.3 the induced shift (Je: f2o ^ 
is Pg-preserving, almost-surely invertible and ergodic with respect to Pg. □ 

In the present circumstances. Theorem 3.2 has one important consequence: 
Lemma 3.4. Let B G be a subset of LIq such that for almost all to G B, 

G B) = l. (3.8) 

Then B is a zero-one event under Pg. 


Proo/The Markov property and (3.8) imply that Pg;j(rx„w G B) = Iforalln > 
1 and Pg-almost every lo G B. We claim that ae{uj) G B for Pg-almost every to G 
B. Indeed, let w G H be such that G B for all n > 1, Pg_;j-aImost surely. 
Let n(w) be as in (3.3) and note that we have n{uj)e G “ifoo- By the uniqueness of 
the infinite cluster, there is a path of finite length connecting 0 and n(aj)e. If £ is 
the length of this path, we have Bo,lu(A'£ = n(u’)e) > 0. This means that (Je(w) = 
Tn(uj)e(^) G he., B is almost surely (Te-invariant. By the ergodicity of the 
induced shift, B isa zero-one event. □ 

Our next goal will be to prove that the Markov chain on environments is er¬ 
godic. Let X = and define ^ to be the product cr-algebra on X; 

The space A is a space of two-sided sequences (..., w_i, wg, uji ,...)—the trajec¬ 
tories of the Markov chain on environments. (Note that the index on co is an index 
in the sequence which is unrelated to the value of the configuration at a point.) 
Let p be the measure on {X, 3f) such that for any B G 


— ni ■ • ■ j LOn) G B) 

~ I ni tlCcJ—yj+l) ' ' ' ticjyj), (3. 

J B 


9) 


where Q is the Markov kernel defined in (2.1). (Since Pg is preserved by Q, 
these finite-dimensional measures are consistent and p exists and is unique by 
Kolmogorov’s Theorem.) Clearly, ('rxfc(w))fc>g has the same law in Eg(Pg (j(-)) 
as (wg, wi,...) has in /r. Let T: X ^ X he the shift defined by (Tw)„ = Wn+i. 
Then T is measure preserving. 


Proposition 3.5. T is ergodic with respect to p,. 
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Proof. Let denote expectation with respect to /r. Pick A<Z X that is measurable 
and T -invariant. We need to show that 

m(A)g{0,1}. (3.10) 

Let /: 17 ^ K be defined as /(wo) = Eij,{1a |wo). First we claim that f = 1 a 
almost surely. Indeed, since A is T-invariant, there exist A^ G crioJk- k > 0) 
and A- G cr(wfc : k < 0) such that A and A± differ only by null sets from one 
another. (This follows by approximation of A by finite-dimensional events and 
using the T-invariance of A.) Now conditional on ojq, the event A+ is independent 
of a{ujk '■ k < 0) and so Levy’s Martingale Convergence Theorem gives us 

Efj,{lA Iwo) = Iwq) = |wo, W_i, . . . , tO-n) 

(3.11) 

— Efj, (lyt_ 1^01 , . . . , LO—yC} ^ 1A— — 1A; 

n—*oo 

with all equalities valid p-almost surely. 

Next let B C 17 be defined by i? = {wq: /(wq) = !}• Clearly, B is 1^- 
measurable and, since the wo-marginal of /i is Pq, 

^iiA) = Ef,if)=VoiB). (3.12) 

Hence, to prove (3.10), we need to show that 

Po(B)g{0,1}. (3.13) 

But A is T-invariant and so, up to sets of measure zero, if wq G i? then oji G B. 
This means that B satisfies condition (3.8) of Lemma 3.4 and so (3.13) holds. □ 

Now we can finally prove Theorem 3.1: 

Proof of Theorem 3.1. Recall that {TXk{‘^))k>o has the same law in Eo(To.(.j(-)) 
as (wo, wi, ... ) has in /r. Hence, if g{... wq, wi, ■ • ■) = /(“^o) then 

.J oo ^ oo 

lim — 'S^foTxi, = lim — goT^. (3.14) 

n —>^oo 77, • ^ ' n —>^00 77, * ^ 

The latter limit exists by Birkhoff’s Ergodic Theorem and (by Proposition 3.5) 
equals E^{g) = Eo(/) almost surely. The second part is proved analogously. □ 

4. Sublinearity along coordinate directions 

Equipped with the tools from the previous two sections, we can start addressing 
the main problem of our proof: the sublinearity of the corrector. Here we will prove 
the corresponding claim along the coordinate directions in if. 

Eix e with |e| = 1 and let n(w) be as defined in (3.3). Define a sequence nk{uj) 
inductively by ni(w) = n{uj) and nfe+i(u;) = nk{(Je{oj)). The numbers (rifc), 
which are well-defined and finite on a set of full Pg-measure, represent the suc¬ 
cessive “arrivals” of to the positive part of the coordinate axis in direction e. 
Let X be the corrector defined in Theorem 2.2. The main goal of this section is to 
prove the following theorem: 
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Theorem 4.1. For '¥^)-almost all oj e l7o. 


X{nkiuj)e,uj) ^ 

k—*oo k 


(4.1) 


The proof is based on the following facts about the moments of x(nfc(w)e, w): 
Proposition 4.2. Abbreviate Ve = Ve{oj) = ni{uj)e. Then 

( 1 ) Eo(|x(t^e,-)l) < oo. 

(2) ]Eo(x(ve,-)) = 0. 


The proof of this proposition will in turn be based on a bound on the tails of 
the length of the shortest path connecting the origin to v^- We begin by showing 
that \ve\ has exponential tails: 

Lemma 4.3. For each p > Pc there exists a constant a = a(p) > 0 such that for 
all e with |e| = 1, 

Po(|we| > n) < e““", n>l. (4.2) 


Proof. The proof uses a different argument in d = 2 and d > 3. In d > 3, we will 
use the fact that the slab-percolation threshold coincides with p^, as was proved 
by Grimmett and Marstrand [21]. Indeed, given p > p^, Isi K > 1 be so large 
that X {1,..., contains an infinite cluster almost surely. By the unique¬ 
ness of the percolation cluster in if, this slab-cluster is almost surely a subset 
of '^oo. Our bound in (4.2) is derived as follows: Let Ak be the event that at least 
one of the sites in {je: j = 1,K} is contained in the infinite connected com¬ 
ponent in X {1,.. .,K}. Then {|?;e| > Kn} n {0 G ^oo} C C\i<nTiKe{A). 
Since the events TiKe{A), f = 1,..., n, are independent, letting px = V{Ak) we 
have 

F{\ve\>Kn,0e‘^^)<{l-pKr, n>l. (4.3) 

From here (4.2) follows by choosing a appropriately. 

In dimension d = 2, we will instead use a duality argument. Let An be the 
box {1, ... ,n} X {1,... ,n}. On {|ve| > n} n {0 G ‘rfao}, none of the boundary 
sites {je: j = 1,..., n} are in ‘rfao- So either at least one of these sites is in a 
finite component of size larger than n or there exists a dual crossing of An in the 
direction of e. By the exponential decay of truncated connectivities (Theorem 8.18 
of Grimmett [19]) and dual connectivities (Theorem 6.75 of Grimmett [19]), the 
probability of each of these events decays exponentially with n. □ 

Our next lemma provides the requisite tail bound for the length of the shortest 
path between the origin and Ve : 

Lemma 4.4. Let L = L{ui) be the length of the shortest occupied path from 0 
to Ve- Then there exist a constant C < oo and a > 0 such that for every n > 1, 


Fo{L >n)< Ce-^". 


(4.4) 




Simple random walk on percolation clusters 


17 


Proof. Let (0, x) be the length of the shortest path from 0 to a; in configuration to. 
Pick e > 0 such that en is an integer. Then 

en 

{L > n} C {|ue| > U U{d ke) > n; 0, ke G'rfoo}■ (4.5) 

k=l 

In light of Lemma 4.3, the claim will follow once we show that the probability 
of all events in the giant union on the right-hand side is bounded by e““ " with 
some a' > 0 (independently of k). 

We will use the following large-deviation result from Theorem 1.1 of Antal 
and Pisztora [2]: There exist constants a,p < oo such that 

P(d„(0,a;) > p|x|) < e-“l^l (4.6) 

once |x| is sufficiently large. Unfortunately, we cannot use this bound in (4.5) di¬ 
rectly, because ke can be arbitrarily close to 0 (in £°° distance on Z'*). To circum¬ 
vent this problem, let We be the site —me such that m = min{m' > en: —m'eG 
and let A^.y = {d^{x,y) > ”/2, x,y € '^oo}- Then, on {di^(0,a:) > n}, 
either Iruej > 2en or at least one site “between” —2ene and —ene is connected 
to either 0 or ke by a path longer than "/ 2 . Since on {|we| > 2en} we must 
have |u_e o > en for at least one m = 1,... en, we have 


{di^(0, ke) > n; 0, fce € “^oo} 

en 

C ( U cr™g({|u_e| > en}) U IJ (Ao _ee U (4.7) 

m —1 en<^< 2 en 

Now all events in the first giant union have the same probability, which is expo¬ 
nentially small by Lemma 4.3. As to the second union, by (4.6) we know that 

Po(Ao,-ee) < e-“^ < e-“" (4.8) 

whenever e is so small that 4ep < 1, and a similar bound holds for Ake-ie as well 
(except that here we need 6ep < 1). The various unions then contribute a linear 
factor in n, which is absorbed into the exponential once n is sufficiently large. □ 

It is possible that a proper merge of the arguments in the previous two proofs 
might yield the same result without relying on Antal and Pisztora’s bound (4.6). 
(Indeed, the main other “external” ingredient of our proofs is Grimmett and Mar- 
strand’s paper [21] which lies at the core of [2] as well.) However, we find the 
argument using (4.6) conceptually cleaner and so we are content with the present, 
even though not necessarily optimal, proof. 

Next we state a trivial, but interesting technical lemma; 


Lemma 4.5. Let p > 1 and r G Suppose that Xi, X 2 , ■ ■ ■ are random 

variables such that supj>]^ l|7fj||p < 00 and let N be a random variable taking 
values in positive integers such that N £ for some s satisfying 


s > r 


1 + Vp 

1 - Vp' 


(4.9) 
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Then ^3 G Explicitly, 

N ri 1 1 

<C(sup||X,||p)(||iV|U)^^''’'-''^J, (4.10) 

^->1 

where C is a finite constant depending only on p, r and s. 

Proof. Let us define g G (1, 00 ) by Vp + Vq = 1- From the Holder inequality and 
the uniform bound on ||p we get 

N 

e\£^3 


n>l j—1 

< (supIIXjIIp)’' P{N = nf'o. 

n>l 

Under the assumption that N has s moments, we get 

P{N = (4.12) 

n>l ^n>l ^ 

by invoking the Holder inequality one more time. The first term on the right-hand 
side is finite whenever s obeys the bound (4.9). □ 

Proof of Proposition 4.2. Let x(a;,u;) be the corrector. By Theorem 2.2, on the 
set {x G ’ifoo}, x(^> •) is an L^-limit of functions Xe(x, •) = 0 x 3 ; — as e J, 0 . 

To prove that x(ve, •) G L^, recall the notation from Lemma 2.4 and let—as 
in Lemma 4.4 —L = L(oj) be the length of the shortest path from 0 to Ve. Then 

\XeiVe,Uj)\< *^4.13) 

\x\oo"^L{u)) e: |e| = l 

But Theorem 2.2 ensures that \\G^f '' o r 3;||2 < ||2 < C for all x and e 

and all e > 0, while the number of terms in the sum does not exceed N{uj) = 
2d{2L{uj) + 1)'*. By Lemma 4.4, N has all moments and so, by Lemma 4.5, 
supg>o IIXe(t^e, Ollr < oo for all T G [1, 2). In particular, x(t^e, •) G L^. 

In order to prove part (2), we first note that a uniform bound on L’'-norm 
of Xei^e, •) for some r > 1 implies that the family {xe(we, •)}e>o is uniformly 
integrable. Since Xe{ve,-) ^ xi^e,-) in probability, Xe{ve,-) ^ x(ve,-) in 
and it thus suffices to prove 

Eo(x.K,-)) =0, £>0. (4.14) 

This is implied by Theorem 3.2 and the fact Xe(ve,-) = fe o o'e — fe with 
absolutely integrable. □ 
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Proof of Theorem 4.1. Let /(w) = w), and let be the induced shift 

in the direction of e. Then we can write 

fc-i 

x(nfc(w)e,a;) = ^/ocrf(w). (4.15) 

l=Q 

By Proposition 4.2 we have f G and Eo(/) = 0. Since Theorem 3.2 ensures 
that (Te is Pg-preserving and ergodic, the claim follows from Birkhoff’s Ergodic 
Theorem. □ 


5. Sublinearity everywhere 

Here we will prove the principal technical estimates of this work. The level of con¬ 
trol is different in c? = 2 and d > 3, so we treat these cases separately. (Notwith¬ 
standing, the d> 3 proof applies in d = 2 as well.) 


5.7. Sublinearity in two dimensions 

We begin with an estimate of the corrector in large boxes in 7 ?: 


Theorem 5.1. Let d = 2 and let x be the corrector defined in Theorem 2.2. Then 
for f’^f-almost every to G f2o> 


lim max 

n^oo 

|a:|oo^^ 


\x{x,io)\ 

n 


= 0 . 


(5.1) 


The proof will be based on the following concept: 

Definition 5.2. Given K > 0 and e > 0, we say that a site x G is K, e-good 
(or just good) in configuration to G fl if x G '^oo(w) and 

\x{y,^)-x{x,i^)\<K + e\x-y\ (5.2) 

holds for every y G ‘^oo(w) of the form y = £e, where i G 'll and e is a unit 
coordinate vector. We will use to denote the set of K, e-good sites 

in configuration to. 


On the basis of Theorem 4.1 it is clear that for each e > 0 there exists nK < oo 
such that the Po(0 G ifx.e) > 0. Our first goal is to estimate the size of the largest 
interval free of good points in blocks [—n, n\ on the coordinate axes: 

Lemma 5.3. Let e be one of the principal lattice vectors in 'I? and, given e > 0, 
let K be so large that Po(0 G > 0. For alln > 1 and to G fl, let yo <■■■ < 

yr be the ordered set of all integers from [—n, n] such that yie G Fet 

A„(w) = max (yj-yj^i). (5.3) 

7=1, 

(If no such yi exists, we define An(to) = oo.) Then 

A„ 

lim -= 0, 

n—^oo 77, 


P-almost surely. 


(5.4) 
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Fig. 3. An illustration of the main idea of the proof of Theorem 5.1. Here a square of side n 
is intersected hy a grid G of good lines “emanating” from the good points on the x and y 
axes. The crosses represent the points on these lines which are in ^oo. Along the good lines 
the corrector grows slower than linear and so anywhere on G sublinearity holds. For the 
part of ^oo that is not on G, the maximum principle for a; a; + x(®) 1®!^ us bound the 

corrector by the values on the parts of the grid that surround it, modulo factors of order o(n). 


Proof. Since P is Te invariant and Te is ergodic, we have 
1 " 

lim —— V l{oe»K e} °'^e = ^(0 G '^K.e) (5.5) 

n^oo n + 1 ^' 
fc =0 

P-almost surely. A similar statement applies to the limit n —oo. But if Anin 
does not tend to zero, at least one of these limits would not exist. □ 

Proof of Theorem 5.1. Fix e € (0, V 2 ) and let Kq be such that P(0 € ifx.e) > 0 for 
all K > Kq (we are using that increases with K). Let 17 q C Aq be the set 
of configurations such that the conclusion of Lemma 5.3 applies for both x and y- 
axes, and that shift-invariance (2.12) holds for all x,y in the infinite cluster. We 
will show that for every a; G the limsup in (5.1) is less than 6 e almost surely. 

Let ei and 62 denote the coordinate vectors in If. Fix u; G I7g and ad¬ 
just K > Kq so that 0 G (This is possible by the definition of 17 q.) Then we 
define {xk)k&i, to be the increasing two-sided sequence of all integers such that 
XkCi exhausts all K, e-good points on the ei-axis, i.e., 

XkCi G k G Z. (5.6) 

If An be the maximal gap between consecutive Xj ’s that lie in [—n, n], cf (5.3), we 
define ni{uj) be the least integer such that A„/n < e for all n > ni{uj). Similarly 
we identify a two-sided increasing sequence {yn)nGZ of integers exhausting the 
sites such that 


yke2 G fc G Z, 


(5.7) 
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and let 712 ( 0 ;) be the quantity corresponding to ni(o;) in this case. 

Let no = max{ni, n 2 }. We claim that for all n > 779 ( 0 ;), 

max |x(a;,o;)| < 2iL + Gen. (5.8) 

\x\oo^'fT' 

To prove this, let us consider the grid G = G(o;) of good lines 


{XfcCi -1- 7762 : 

77 G Z}, 

fc G Z, 

(5.9) 

{7761 +yke2- 

77 G Z}, 

k ez, 

(5.10) 


see Fig. 3. As a first step we will use the harmonicity of x x + x(a;, o;) to deal 
with X € \ G. Indeed, any such x is enclosed between two horizontal and 

two vertical grid lines and every path on ‘rfoo connecting x to “infinity” necessarily 
intersects one of these lines at a point which is also in “^oo- Applying the maximum 
(and minimum) principle for harmonic functions we get 


max |x(a;,o;)|<2e7i+ max |x(a;,o;)|. (5.11) 

|x|oo<?^ |x|oo<2n 


Here we used that the enclosing lines are not more than < 2en < n apart 

and, in particular, they all intersect the block [—2r7, 277 ] x [—2r7, 277 ]. 

To estimate the maximum on the grid, we pick, say, a horizontal grid line 
with y-coordinate yk and note that, by (2.12), for every x G *^00 on this line, 

X{x,uj) - x(2/fce2,u;) = x{x - yfcCa, (5-12) 

By (5.7) and the fact that x — 7/^62 G we have 

|x(a;,u;) - x(2/fee2,a;)| < a: + 2e77 (5.13) 

whenever x is such that |x|oo < 277. Applying the same argument to the vertical 
line through the origin, and x replaced by yke 2 , we get 

\x{x,uj)\<2KAen (5.14) 

for every x G *^00 H G with |x|oo < 2r7. Combining this with (5.11), the esti¬ 
mate (5.8) and the whole claim are finally proved. □ 

Interestingly, a variant of the above strategy for controlling the corrector in 
d = 2 has independently been developed by Chris Hoffman [25] to control the 
geodesics in the first-passage percolation on Z^. 
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5.2. Three and higher dimensions 

In fi > 3 we have the following weaker version of Theorem 5.1: 


Theorem 5.4. Let d > 3. Then for all e > 0 and Fo-almost all oj, 


lim sup 

n—^OO 


1 

(2n + 1)^ 


l{|x(a;,w)|>en} 

(^) 

|£c|<n 


= 0 . 


(5.15) 


Here we fix the dimension d and run an induction over ^-dimensional sections 
of the d-dimensional box {a; € Z^: I x\ < n}. Specifically, for each i/ = 1,... ,d, 
let A'^ be the i^-dimensional box 

= {kiei -h Kci,: ki e Z, \ki\ < nWi = I,... ,iy}. (5.16) 


The induction eventually gives (5.15) for v = d thus proving the theorem. 

Since it is not advantageous to assume that 0 G ‘^oo, we will carry out the proof 
for differences of the form x{x, u) — x(2/, w) with x,y G '^oo- For each u G fi, 
we thus consider the (upper) density 


q^(lo) = lim lim sup inf ——- 

^10 ji —.oo C^)nyl^ l^l^l 




1 




(5.17) 


Note that the infimum is taken only over sites in one-dimensional box yl^. Our 
goal is to show by induction that q,j = 0 almost surely for all = 1,... ,d. The 
induction step is encapsulated into the following lemma: 


Lemma 5.5. Let 1 < v < d. If Qu = Q, ¥-almost surely, then also g^+i = 0, 
V-almost surely. 


Before we start the formal proof, let us discuss its main idea: Suppose that 
Pi/ = 0 for some v < d, P-almost surely. Pick e > 0. Then for P-almost every uj 
and all sufficiently large n, there exists a set of sites Z\ C fl such that 

\(A-;^n‘^^)\A\<e\A-;^\ (5.18) 

and 

|x(a:,w) - x(j/,w)| < en, x,y G A. (5.19) 

Moreover, n sufficiently large, Z\ could be picked so that Z\ n 21^ 7 ^ 0 and, assum¬ 
ing K ^ 1, the non-iT, e-good sites could be pitched out with little loss of density 
to achieve even 

^ C (5.20) 

(All these claims are direct consequences of the Pointwise Ergodic Theorem and 
the fact that P(0 G ifx.e) converges to the density of as AT ^ oo.) 

As a result of this construction we have 


X(^,u;) - x(a:,w) < K + en 


(5.21) 
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Fig. 4. The main idea underlying the proof of Theorem 5.4. The figure on the left represents 
an n X n square in a two-dimensional plane in Z®; the crosses now stand for good sites; cf 
Definition 5.2. Here L is chosen so that (1 — 5)-fraction of all vertical lines find a good point 
on the intersection with one of the L horizontal lines; n is assumed so large that every pair 
of these lines has two good points “above” each other. Any two good points x and y in the 
square are connected hy broken-line path that uses at most 4 good points in between. The 
dashed lines indicate the vertical pieces of one such path. The figure on the right indicates 
how this is used to control the difference of the corrector at two general points r,s £ ^oo 
in an n X n X n cube in Z®—with obvious extensions to all d > 3. 


for any x £ A and any 2 ; € n of the form x + je^+i- Thus, if r, s € 

“^oo n are of the latter form, r = x + je^+i and s = y + ke,j+i —see Fig. 4 
for an illustration—then (5.21) implies 

|x(r,w) - X(s,w)| < \xix,uj) - xiy,u))\ + 2K + 2en. (5.22) 

Invoking the “induction hypothesis” (5.19), the right-hand side is less than 2K -|- 
3en, implying a bound of the type (5.19) but one-dimension higher. 

Unfortunately, the above is not sufficient to prove (5.19) for all but a vanishing 
fraction of all sites in The reason is that the r’s and s’s for which (5.22) holds 
need to be of the form x + j e,y+i for some x £ An ‘rfoo ■ But will occupy only 
about Poo = P(0 G '^ 00 ) fraction of all sites in and so this argument does not 
permit us to control more than fraction about Poo of 2lJ^+^ n '^oo- 

To fix this problem, we will have to work with a “stack” of translates of A'^ at 
the same time. (These correspond to the stack of horizontal lines on the left of of 
Fig. 4.) Explicitly, consider the collection of rz-boxes 

J = (5.23) 

Here L is a deterministic number chosen so that, for a given 5 > 0, the set 

Ao = {x£A'^:3j £{0,...,L-1},x+ je,+i G A^^^^ n ^ 00 } (5.24) 


is so large that 


|z\o|>(i-5)K| 


(5.25) 
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once n is sufficiently large. These choices ensure that (1 — 5)-fraction of A'^ is now 
“covered” which by repeating the above argument gives us control over x(r, to) for 
nearly the same fraction of all sites r G A^'^^ n ‘^oo- 

Proof of Lemma 5.5. Let v < d and suppose that = 0, P-almost surely. Fix <5 
with 0 < (5 < and let L be as defined above. Choose e > 0 so that 

Le + 5<^Pl. (5.26) 

For a fixed but large K, and P-almost every w and n exceeding an w-dependent 
quantity, for each j = 1, ..., L, we can find Aj C 2lJ( ^ n '^oo satisfying the 
properties (5.18-5.20)—with replaced by ^ Given Z\i,..., Al, let A be 
the set of sites in IT whose projection onto the linear subspace IH = 

{fciei + • • • + /Ci/Ci/: fci G Z} belongs to the corresponding projection of Z\i U 
• • • U Z\l. Note that the Aj could be chosen so that Af] 

By their construction, the projections of the Aj’s, j = 1,..., L, onto H “fail 
to cover” at most Le|2ljj| sites in Z\o, and so at most (S + Le)|yljj| sites in A'^ are 
not of the form x + for some x G IJ^ Aj. It follows that 

|(ylj;+in-^oo)\^| < (S + Le)IA^+^l, (5.27) 

i.e., A contains all except at most (Le + 5)-fraction of all sites in that we care 
about. Next we note that if K is sufficiently large, then for every 1 < i < j < L, 
the set H contains at least -fraction of sites x such that 

= X + iciy G and zj x + je^ G (5.28) 

Since we assumed (5.26), once n ^ 1, for each pair (i,j) with ^ < i < 3 < L 

such Zi and Zj can be found so that Zi G Ai and Zj G Aj. But the Z\j’s were 
picked to make (5.19) true and so via these pairs of sites we now show that 

\x{y,^) - xix,ui)\ < K + eL + 2en (5.29) 

for every x,y G Z\i U • • • U see again (the left part of) Fig. 4. 

From (5.19) and (5.29) we now conclude that for all r,s G A, 

\x{r, u) — x(s, w) I < 3iT + eL + 4en < 5en, (5.30) 

provided that en > 376 + eL. If denotes the right-hand side of (5.17) before 

taking e J, 0, the bounds (5.27) and (5.30) and Af] A\f^% yield 

Qv+i, 5 e{^) "A 5 + Le, (5.31) 

for P-almost every lo. But the left-hand side of this inequality increases as e J, 
0 while the right-hand side decreases. Thus, taking e J, 0 and 5 J, 0 proves 
that Pu+i = 0 holds P-almost surely. □ 

Proof of Theorem 5.4. The proof is an easy consequence of Lemma 5.5. First, by 
Theorem 4.1 we know that (w) = 0 for Pp-almost every lo. Invoking appropriate 
shifts, the same conclusion applies P-almost surely. Using induction on dimension. 
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Lemma 5.5 then tells us that Qd{<^) = 0 for Po-almost every uj. Let lo G f2g. By 
Theorem 4.1, for each e > 0 there is ng = ng{uj) with Po(no < oo) = 1 such 
that for all n > no(w), we have \x{x,ijj)\ < en for all x G (oj). Using 

this to estimate away the infimum in (5.17), the fact that Qd = 0 now immediately 
implies (5.15) for all e > 0. □ 

6. Proof of main results 

Here we will finally prove our main theorems. First, in Sect. 6.1, we will show the 
convergence of the “lazy” walk on the deformed graph to Brownian motion and 
then, in Sect. 6.2, we use our previous results on corrector growth to extend this 
to the walk on the original graph. This separation will allow us to treat the parts 
of the proof common for d = 2 and d > 3 in a unified way. Theorem 1.2, which 
concerns the “agile” walk, is proved in Sect. 6.3. 

6.1. Convergence on deformed graph 

We begin with a simple observation that will drive all underlying derivations: 

Lemma 6.1. Fix uj G fig and let x x(a:, ui) be the corrector. Given a path of 
random walk (2f„)„>o with law Pq.w. 

+ xiXn,tu), n > 0. (6.1) 

Then is an -martingale for the filtration {cr{Xg,..., Xn))n>o- More¬ 
over, conditional on Xkg = x, the increments — M^^'^)k>g have the same 

law as 

Proof. Since is bounded, x(2f„,w) is bounded and so is square inte- 

grable with respect to To,w- Since x x -\- x{x,uj) is harmonic with respect to 
the transition probabilities of the random walk (2f„) with law Pq.w. we have 

Eg,^ I a(X„)) = , n > 0, (6.2) 

Po,w-almost surely. Since is (T(X„)-measurable, is a martingale. 

The stated relation between the laws of — M^'^)k>g and (M^ is 

implied by the shift-invariance (2.12) and the fact that is a simple random 

walk on the deformed infinite component. □ 

Next we will establish the convergence of the above martingale to Brownian 
motion. The precise statement is as follows: 

Theorem 6.2. Let d > 2, p > and oj G fig. Let (2f„)„>o be the random walk 
with law Pg^uj ond let {M^'^)n>g be as defined in (6.1). Let (B^n\f): t > 0) be 
defined by 

- 


f>0. (6.3) 
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Then for all T > 0 and Wo-almost every u, the law of {Bn{t): 0 < t < T) 
on (C[0,T],Wf) converges weakly to the law of an isotropic Brownian motion 
{Bt : 0 < t < T) with diffusion constant D, i.e., E{B^) = Dt, where 

D = G(0,oo). (6.4) 


Proof Without much loss of generality, we may confine ourselves to the case 
when T = 1. Let = ct^Xq, Xi,..., Xf) and fix a vector a G K.'^. We will 
show that (the piece-wise linearization) of f scales to one-dimensional 

Brownian motion. For m < n, consider the random variable 



■E 


El 


'0,a; 


([a-(M, 


(^) 
fc + 1 


Mt^)] 





(6.5) 

In order to apply the Lindeberg-Feller Functional CLT for martingales (Theo¬ 
rem 7.7.3 of Durrett [15]), we need to verify that for Po-almost every uj, 

^ Po,w-probability for all t G [0,1] and some C G (0, oo). 
(2) Vn'il (e) ^ 0 in Po,w-probability for all e > 0. 

Both of these conditions will be implied by Theorem 3.1. Indeed, by the last con¬ 
clusion of Lemma 6.1 we may write 


^ 7/6 

14^!^ (e) = - E ° 

’ n ^ 

where 

//.H (6.7) 

Now if e = 0, Theorem 3.1 tells us that, for Pp-almost every w, 

£r^Li;^;,)(0) =Eo(L;o..([a-M(“)]2)) = ip|a|4 (6.8) 

where we used the symmetry of the joint expectations under rotations by 90°. 
From here condition (1) follows by scaling out the f-dependence first and working 
with tn instead of n. 

On the other hand, when e > 0, we have < fK once n is sufficiently 
large and so, Pp-almost surely, 

limsupLi“)(e)<Eo(^^o.u.([a-M}“^]2L ^ 0, (6.9) 

n—^00 II 1 i_ j / fi —^00 

where to apply Dominated Convergence we used that a ■ G L'^. Hence, 

the above conditions (1) and (2) hold—in fact, even with limits taken Po,uj-almost 
surely. Applying the Martingale functional CLT and the Cramer-Wold device (The¬ 
orem 2.9.2 of [15]), we conclude that, for Pp-almost every co, the linear interpola¬ 
tion of the sequence /y^)k=i^...^n converges to isotropic Brownian motion 

with covariance matrix 1. 
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To make the proof complete, we need to show that D G (0, oo). Here the finite¬ 
ness is immediate by the square-integrability of x- The positivity can be shown in 
many ways: either by a direct computation from (6.4) using that Eo(ii^o,w(-^i • 
x(^i,oj)) = 0 [which in turn is implied by Eo(x(e,u;) l{a;,,=i}) = 0 for ev¬ 
ery coordinate vector e] or by invoking the sublinearity of the corrector proved in 
Theorems 5.1-5.4, or by an appeal to the lower (or, alternatively, upper) bound 
in [3, Theorem 1]. □ 

6.2. Correction on the corrector 

It remains to estimate the influence of the harmonic deformation on the path of 
the walk. As already mentioned, while our proof in d = 2 is completely self- 
contained, for d > 3 we rely heavily on (a discrete version of) the sophisticated 
Theorem 1 of Barlow [3]. 

Let us first dismiss the two-dimensional case of Theorem 1.1: 

Proof of Theorem 1.1 (d = 2). We need to extend the conclusion of Theorem 6.2 
to the linear interpolation of (2f„). Since the corrector is an additive perturbation 
of it clearly suffices to show that, for Po-almost every uj, 

max in Po,w-probability. (6.10) 

l<fe<n yn n—^oo 

By Theorem 5.1 we know that for every e > 0 there exists a K = K{uj) < oo 
such that 

\x{x,uj)\ < K + e\x\^, xG^oo{(^)- (6.11) 

If e < 1 / 2 , then this implies 

\x{Xk,tu)\<2K + 2e\Ml^'>\. (6.12) 

But the above CLT for (Mk) tells us that maxfe<„ \Mjf '^\/y/n converges in law to 
the maximum of a Brownian motion B{t) over t € [0,1]. Hence, if P denotes the 
probability law of the Brownian motion, the Portmanteau Theorem (Theorem 2.1 
of [7]) allows us to conclude 

limsupPo.tj(max|x(2ffc,w)| > Sy/n) < p( max \B{t)\ > (6.13) 

k<n \ 0<t<l 26/ 

The right-hand side tends to zero as e J, 0 for all 5 > 0. □ 

In order to prove the same result in d > 3, we will need the following upper 
bounds on the transition probability of our random walk: 

Theorem 6.3. (1) There is a random variable C = C{lo) with Po(C' < 00 ) = 1 
such that for all lu G Hq and all x G ^c>o(oj), 

Po,uj(.Xn = x)< , 


n > 1. 


(6.14) 
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(2) There are constants Ci, C 2 € (0, 00 ) and random variables = Nx{uj) such 
that for all oj € f2o, all x G (w), all R> 1, and all n > Nx (w), 

Px,ui{\Xn -x\> R) < Cl exp{-C 2 i?^/n}. (6.15) 

Moreover, the random variables (Nx) have stretched-exponential tails, i.e., there 
exist constants C 3 > 0 and 9 > 0 such that for all x G Tf, 

¥o{Nx > R)< R>1. (6.16) 

For a continuous-time version of our walk, these bounds are the content of 
Theorem 1 of Barlow [3]. (In fact, the continuous-time version of the bound (6.14) 
was obtained already by Mathieu and Remy [31].) Unfortunately, to derive The¬ 
orem 6.3 from Barlow’s Theorem 1, one needs to invoke various non-trivial facts 
about percolation and/or mixing of Markov chains. In Appendix A we list these 
facts and show how to assemble all ingredients together to establish the above 
upper bounds. 

Proof of Theorem 1.1 (d > 3). We will adapt (the easier part of) the proof of 
Theorem 1.1 in Sidoravicius and Sznitman [38]. First we show that the laws of 
{Bn(t): t < T) on (([/[O, T], Wt) are tight. To that end it suffices to show (e.g., 
by Theorem 8.6 of Ethier-Kurtz [16]) that if is the class of all stopping times 
of the filtration {a{{Bn{s): s < t}))o<t<T, then 

limsup limsup sup i?o.w(|S„(r + e) — i3„(r)p) = 0. (6.17) 

ej.0 n-i-oo 

As in [38], we replace t by its integer-valued approximation. Explicitly, let t = 
[nrj + 1 and let 5 be a number such that n6 = [ne\ -f 1. Since f differs from nr 
by a constant of order unity, and similarly for t + n6 and n{T -f e), we have 

\B^{t + e) - i?„(r)| < -^\Xx+nS - Xx\ + ^ (6.18) 

s/n y/n 

for some constant C 4 < 00 . This allows us to estimate (6.17) by means of the 
second moment of \Xf^nS — Xf\. 

Recalling that t < T, we may assume that f < 2Tn. By (6.16) we know 
that there exists an almost-surely finite random variable C = C'{lo) such that 
max| 2 ,|<fl. A/r < C"(u;)(log 7?)“’ once R > 2, where C = Since iXfl < 
2Tn, this implies that Nx^ < C"(w)[log(2Tn)]‘’. Theorem 6.3(2) and the strong 
Markov property —t is a stopping time of the random walk — tell us that, for some 
constant C 5 < 00 (depending only on Ci, C 2 and the dimension), 

Eo,u;{\Xf+nS - Xf\'^) < c^en, n > no(w). (6.19) 

Here we used e — 6 = 0(Vn) and let no(cc) be such that Sn > C"(w)[log(2Tn)]^ 
for all n > riQ. The bound (6.17) is now proved by combining (6.18-6.19) and 
taking the required limits. 

Once we know that the laws of (i?„(/) '■ t < T) are tight, it suffices to show 
the convergence of finite-dimensional distributions. In light of Theorem 6.2 (and 
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the Markov property of the walk), for that it is enough to prove that for alH > 0 
and Po-almost every co, 




0 


in -Po.w-probability. 


( 6 . 20 ) 


Without loss of generality, we need to do this only for f = 1. By Theorem 6.3, the 
random variable lies with probability 1 — e in the block [—M^/n, n 

Z"^, provided M sufficiently large (with “large” depending possibly on uj). Using 
Theorem 6.3(1) to estimate Po,uj{Xn = x) for x inside this block, we have 

Po,uj{\xiXn,X!)\ > 6^/n) < e + C{uj) ^ l{|x(a;,w)|>5V"} ■ (6-21) 

\x\<M^/ii 

But Theorem 5.4 tells us that, for all 6, M > 0 and Pp-almost every w, the second 
term tends to zero as n ^ oo. This proves (6.20) and the whole claim. □ 


6.3. Extension to “agile” walk 


It remains to prove Theorem 1.2 for the “agile” version of simple random walk 
on ■ Since the proof is based entirely on the statement of Theorem 1.1, we will 
resume a unified treatment of all d > 2. First we will make the observation that 
the times of the two walks run proportionally to each other: 


Lemma 6.4. Let {Tk)k>o be the stopping times defined in (1.7). Then for all f > 0 
and V^-almost every to. 


where 


T, 


[tnj 


Ot, 


Po,w-alitiost surely, 


( 6 . 22 ) 



0 



E 


— 1 } 


(6.23) 


Proof. This is an easy consequence of the second part of Theorem 3.1 and the fact 
that for Pp-almost every uj we have ^ w once x Q. Indeed, let f{uj, lo') = 
For t = 0 the statement holds trivially so let us assume that f > 0. If n is 
so large that T^nt] > 0, we have 

^ Ut"j 

7f^ = If - E (6.24) 

7 [tn] 7 Lt„j ^ 

Since T^tn] oo as n —> oo, by Theorem 3.1 the right hand side converges 
to the expectation of f{uj,TXiXi) in the annealed measure Eo(Po,w('))- ^ direct 
calculation shows that this expectation equals 0. □ 

Proof of Theorem 1.2. The proof is based on a standard approximation argument 
for stochastic processes. Let Bn{t) be as in Theorem 1.1 and recall that is a 
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linear interpolation of the values Bn{Tk/n) for k = 0, ... ,n. The path-continuity 
of the processes Bn{t) as well as the limiting Brownian motion implies that for 
every e > 0 there is a 5 > 0 such that 

Po,iu{ sup \Bn{t) - Bn{t')\ < e]> 1 - e (6.25) 

\t — t'\<S 


once n is sufficiently large. Similarly, Lemma 6.4, the continuity of t 0t and 
the monotonicity ofk^Tk imply that for n sufficiently large. 


Pf 


0,u; 


sup 

t<T 


T, 


[tnj 


-et 


< (5 > 1 - e. 


(6.26) 


On the intersection of these events, the equality B'^{k/n) = Bn{Tk/n) yields 

max <e. (6.27) 

0<fc<LTnJ I "" " 'I 

In light of piece-wise linearity this shows that, with probability at least 1 — 2e, the 
paths 1 1 -^ B'^ (t) and 1 1 —> Bn {Ot) are within a multiple of e in the supremum norm 
of each other. In particular, if Bt denotes the weak limit of the process {Bn{t ): t < 
T), then {B'nit): t < T) converges in law to {B^t- t < T). The latter is an 
isotropic Brownian motion with diffusion constant D' = D0^. □ 


A. Heat-kernel upper bounds 

Let {Zt)t>o denote the continuous-time random walk which attempts a jump to 
one of its nearest-neighbors at rate one (regardless of the number of accessible 
neighbors). Let qf{x,y) denote the probability that Zt started at a; is at y at 
time t. In his paper [3], Barlow proved the following statement: There exist con¬ 
stants 671,(72 G (0, oo) and, for each a: G a random variable S'(a;) = S{x,uj) G 
(0, oo) such that for all x,y G '^(w) and all t > S{x), 

qt{x,y) < exp{-C 2 \x - y\‘^/t}. (A.l) 

Moreover, S{x) has uniformly stretched-exponential tails, i.e., 

¥o{S{x) > R) < R>1. (A.2) 

Barlow provides also a corresponding, and significantly harder-to-prove lower 
bound which requires the additional condition t > \x — y\. However, for (A.l), 
this condition is redundant. 

In the remarks after his Theorem 1, Barlow mentions that appropriate mod¬ 
ifications to his arguments yield the corresponding discrete time estimates. Here 
we present the details of these modifications which are needed to make our proof 
of the invariance principles in Theorems 1.1 and 1.2 complete. Notice that we do 
not re-prove Barlow’s bounds in their full generality, just the absolute minimum 
necessary for our purposes. 
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A.l. Uniform bound 

There will be two kinds of bounds on the heat-kernel as a function of the terminal 
position of the walk after n steps: a uniform bound by a constant times and 

a non-uniform, Gaussian bound on the tails. We begin with the statement of the 
uniform upper bound: 

Proposition A.l. Let d > 2 and let p > pdd). There exists a random vari¬ 
able C = C(oj) with P(C' < oo) = 1 such that for all u € Oq and all x G '^oo, 

Po,LoiXn = x) < n > 1. (A.3) 

The proof will invoke the isoperimetric bound from Barlow [3]: 

Lemma A.2. There exists a constant c G (0, oo) such that for VQ-almost every oj 
and all R sufficiently laree, 

^>e|arv. (A,4, 

for all A C “^oo n [~-R) R]‘^ such that |A| > 


Proof This is a consequence of Proposition 2.11 on page 3042, and Lemma 2.13 
on page 3045 of Barlow’s paper [3]. □ 

This isoperimetric bound will be combined with the technique of evolving sets, 
developed by Morris and Peres [32], whose salient features we will now recall. 
Consider a Markov chain on a countable state-space V, let p{x, y) be the transi¬ 
tion kernel and let tt be a stationary measure. Let Q{x, y) = Tr{x)p{x, y) and for 
each Si, S 2 C V, let Q{Si, S 2 ) = set S' C C 

with finite non-zero total measure 7 r(S) we define the conductance <Ps by 


<Ps 


Q{s,sd 

7 r(S) 


For sufficiently large r, we also define the function 


<?(r) = inf{<? 5 : 7 r(S) < r}. 


(A.5) 


(A.6) 


The following is the content of Theorem 2 in Morris and Peres [32]: Suppose 
that p{x, x) > y for some 7 G (0, V 2 ] and all x &V. Let e > 0 and x,y G V.lf n 
is so large that 


n > 1 -I- 


^ r 

T J4[,r(^)A7r(y)] 


(A.7) 


then 

P'^ix,y) <e7T{y). (A. 8 ) 

Equipped with this powerful result, we are now ready to complete the proof of 
Proposition A.l: 
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Proof of Proposition A.l. First we will prove the desired bound for even times. 
Fix w S 17 and let = X 2 n be the random walk on observed only at 

even times. For each x,y € '^oo(w), let us use p{x, y) to denote the transition 
probability Px,ai(Yi = y). Let 7r(a;) denote the degree of x on ‘^oo(w)- Then tt is 
an invariant measure of this chain. Moreover, by our restriction to even times we 
have p{x, x) > (2d)~^ > 0 and so (A.7-A.8) can be applied. 

By Lemma A.2 we have that d>s > c 7 r(S)~d for some c > 0 and all sets S of 
the form S = H [—i?, RY for i? ^ 1. Hence tP(r) < c' r~i for some finite 
c' = Plugging into the integral (A.7) and using that tt is bounded, we find 

that if n > ce“ 3 , then (A. 8 ) holds. Here c is a positive constant that may depend 
on Lo. Choosing the minimal n possible, and applying y) = Px,u{yn = y), 
the bound (A. 8 ) proves the desired claim for all even times. To extend the result to 
odd times, we apply the Markov property at time one. □ 

A.2. Gaussian tails 

Next we will attend to the Gaussian-tail bound. Given the random variables S{x,uj) 
from (A.1-A.2), define random variables Nx = Nx{uj) by 

S(yP 

Nx=S{x)\/ sup 1 -r. (A.9) 

y: y^x IV ~ x\ 

Here is a restatement of the corresponding bound from Theorem 6.3: 

Proposition A.3. Let d > 2 andp > Pc{d). There exist constants Ci,C 2 G (0, oo) 
such that for all iv G Gq, all x G '^oo(w)> all R> 1 and all n > Nx{uj), 

Px,uj{Xn = y) < Ciexp{-C 2 i?^/n}. (A.IO) 

y: \y-x\>R 


Proof The proof is an adaptation of Barlow’s Theorem 1 to the discrete setting. 
Let {Xn) be the discrete time random walk, and let {Zt)t>o be the continuous time 
random walk with jumps occurring at rate 1, both started at x. We consider the cou¬ 
pling of the two walks such that they make the same moves. We will use P and E 
to denote the coupling measure and the corresponding expectation, respectively. 

Let n > Nx; and let A„ be the event that | Ar„ — a;| > R. Pick K > \ and let 


In 


/*4n 


'^{\Zt-x\>R/K} df 


(A.ll) 


be the amount of time in [n,4n] that the walk {Zt) spends at distance larger 
than ^/k from x. By the inequality 


P{An) < 


Ejln) 

E{In\An)’ 


(A.12) 


it suffices to derive an appropriate upper bound on E{In) and a matching lower 
bound on i7(/„|A„). Note that we may assume that R < n because otherwise we 
have P{An) = 0 and there is nothing to prove. 
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To derive an upper bound on i?(/„), we note that for t > n, our choice n > 
implies t > S{x). The expectation can then be bounded using (A.l); 

/ 4n 

qt{x,y)dt 

<Ci ( Y e-^=l^l'/‘df < C'4ne-'^'^^, 

^ a:: |a:| 

where C 4 and C 5 are constants (possibly depending on K). 

It thus remains to prove that, for some constant Cg > 0, 


E{In\An)>CQn. (A.14) 

To derive this inequality, let us recall that the transitions of Zt happen at rate one, 
and they are independent of the path of the walk. Hence, if Bn is the event that Zt 
attempted at least n jumps by time 2n, then P(i3„| A„) = P{Bn) is bounded away 
from zero for all n > 1. Therefore, it suffices to prove that > C^n. 

Let T be the first time when the walk (Zt) is farther from x than R. On n 
Bn, this happens before time 2n, i.e., T < 2n. Let Qr = [—R,R]‘^ n and 
Qr/k = [—^/k, ^/k]‘^ n lA. Then for values 2; on the external boundary of Qr?— 
which are those that Zt can take—the bound (A.l) tells us 

Y inax|s"'^/^exp(-iC'2i?Vs)} < 

V^Qr/k 

(A.15) 

provided that t > S{z). But our assumptions n > and R < n imply n > S{z), 
and so in light of the fact that T < 2n on n Bn, (A.15) actually holds for all t 
such that T + t G [3n, 4n]. Plugging Zt for 2 on the left-hand side and taking 
expectation gets us an upper bound on P{Zt € Qu/xlAn H Bn )—with t now 
playing the role of T + t. Hence, 

EiIn\AnnBn)> / P{Zt ^ Q R/ K\An ^ Bn) > n^l - C7 R-^) . (A.16) 

J3n 

Choosing K sufficiently large, the right-hand side grows linearly in n. □ 

Proof of Theorem 6.3. Part (1) is a direct consequence of Proposition A.l, while 
part (2) follows from Proposition A.3 and the fact that if the S{x) have stretched 
exponential tails (uniformly in x), then so do the □ 


B. Some questions and conjectures 

While our control of the corrector in c? > 3 is sufficient to push the proof of the 
functional CLT through, it is not sufficient to provide the conceptually correct 
proof of the kind we have constructed for d = 2. However, we do not see any 
reason why d >3 should be different from d = 2, so our first conjecture is: 
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Conjecture 1. Theorem 5.1 is true in all d > 2. 

Our proof of Theorem 5.1 in d = 2 hinged on the fact that the corrector plus 
the position is a harmonic function on the percolation cluster. Of interest is the 
question whether harmonicity is an essential ingredient or just mere convenience. 
Yuval Peres suggested the following generalization of Conjecture 1: 

Question 2. Let /: Z'* — > M. be a shift invariant, ergodic process on if whose 
gradients are in and have expectation zero. Is it true that 

lim — max |/(a^)| =0 (B.l) 

n^oo n £!;eZ‘*n[-n,n]‘^ 

almost surely? 

Update: The above question, while obviously true in d = 1, has a negative answer 
in all d > 2. The first counterexample, based on constructions in [43] and [11], 
was provided to us by Martin Zerner. Later Tom Liggett pointed out the following, 
embarrassingly simple, counterexample: Let f{x) be i.i.d. with distribution func¬ 
tion P{f{x) > u) = u~^ for M > 1. Then if(x))^^z‘‘ is shift-invariant, ergodic, 
with / e and the gradients of / having zero mean, yet n~^ max| 3 ,|<„ |/(x)| 
has a non-trivial distributional limit as n ^ oo. 

The harmonic embedding of has been indispensable for our proofs, but it 
also appears to be a very interesting object in its own right. This motivates many 
questions about the corrector x(a;, to). Unfortunately, at the moment it is not even 
clear what properties make the corrector unique. The following question has been 
asked by Scott Sheffield: 

Question 3. Is it true that, for a.e. to G Qq, there exists only one vector-valued 
function x ‘^) on (w) such that x x-\-x{x, w) is harmonic on (oj), 

x(0,oj) = 0 and x(a;, u;)/|a;| ^ 0 as \x\ oo? 

If this question is answered in the affirmative, we could generate the corrector 
by its finite-volume approximations (this would also fully justify Fig. 1). If we re¬ 
strict ourselves to functions that have the shift-invariance property ( 2 . 12 ), unique¬ 
ness can presumably be shown using the “electrostatic methods” from, e.g., [18]. 
However, it is not clear whether (2.12) holds for the corrector defined by the ther¬ 
modynamic limit from finite boxes. 

As to the more detailed properties of the corrector, for the purposes of the 
present work one would like to know how xix,u}) scales with x and whether it 
has a well-defined scaling limit. We believe that, in sufficiently high dimension, 
the corrector is actually tight: 

Conjecture 4. Let d ^ 1. Then for each e > 0 there exists K < oo such that 
Po( |x(a;,u;)| > K \ x & Ifoo) < e for all x € Z'^. 

It appears that one might be able to prove Conjecture 4 by using Barlow’s heat- 
kernel estimates. To capture the behavior in low dimensions, we make a somewhat 
wilder guess: 
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Conjecture 5. Let d > 1. Then the law of x e^~ x([VeJ) on compact subsets 
ofMf converges weakly (as e J, Q) to Gaussian Free Field, i.e., a multivariate 
Gaussian field with covariance proportional to A~^ 1, where A is the Dirichlet 
Laplacian on and 1 is the d-dimensional unit matrix. 

Here is a heuristic reasoning that led us to these conjectures: Consider the 
problem of random conductances to avoid problems with conditioning on contain¬ 
ment in the infinite cluster. To show the above convergence, we need that for any 
smooth /: ^ R. with compact support, 

e‘'^(Z\/)(xe)e^x(a:) ^ M{Q,a^\\Vf\\ll), (B.2) 


where V and A denote the (continuous) gradient and Laplacian, respectively, and 
where N{0, C) is a mean-zero, covariance-C multivariate normal random vari¬ 
able. Next we note that the corrector is defined, more or less, as the solution to the 
equation Z\dX = —V, where V is the local drift and Z\d is the relevant generator, 
which is basically a discrete Laplacian on Z"^. Thus, if g: R'^ ^ R is smooth with 
compact support and ge{x) = g{xe), then 


^ e ^(Aig,){x)x{x) =^ g^{x)V(x) 


= e ‘^/2 


T.— - , (B.3) 


e: |e| = l 

^ AA(0,||Vp||2l) 


The convergence statement (B.2) would then follow from (B.3) provided we can 
replace the “discretized” Laplacian e^^ZldPe by its continuous counterpart Ag. 

Note that for d = 1 and conductances bounded away from zero. Conjecture 5 
is actually a theorem. Indeed, the corrector is a random walk with increments given 
by reciprocal conductances and so the convergence follows by the invariance prin¬ 
ciple for random walks. Conjecture 5 suggests that Conjecture 4 applies for d> 3. 

Despite the emphasis on the harmonic embedding of ‘rf’ao, our proofs used, 
quite significantly, the underlying group structure of Z'^; e.g., in Sect. 4. Presum¬ 
ably this will not prevent application of our method to other regular lattices, but for 
more irregular graphs, e.g., Voronoi percolation in R"^, significant changes may be 
necessary. A similar discussion applies to various natural subdomains of Z"^; for 
instance, it is not clear how to adapt our proof to random walk on the infinite 
percolation cluster in the half-space {x € : Xd > 0}. 

A different direction of generalizations are the models of long-range percola¬ 
tion with power-law decay of bond probabilities. Here we conjecture: 


Conjecture 6. Let d > f ond consider long-range percolation obtained by adding 
to Zf a bond between every two distinct sites x,y G independently with proba¬ 
bility proportional to |a; — If a G (0, 2), then the corresponding random 

walk scales to a symmetric a-stable Levy process in R"^. 
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Note that, according to this conjecture, in c? = 1, the interval a G (0, 2) of 
“interesting” exponents is larger than the interval for which an infinite connected 
component may occur even without the “help” of nearest neighbor connections. 
On the other hand, in dimensions d> 3, the interval conjectured for stable conver¬ 
gence is strictly smaller than that of “genuine” long-range percolation behavior, 
as defined, e.g., in terms of the scaling of graph distance with Euclidean distance; 
cf [4,5,8]. 
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